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(57) The invention relates to a method and apparatus for 
scheduling transmission link bandwidth between 
packet-switched data flows so that a desired CoS (Class of 
Service) is provided with an ability to utilize the 
instantaneously available bandwidth of a data transmission 
network while simultaneously offering a guaranteed 
minimum data rate (Guaranteed Data Rate and Best Effort) 
without compromising the operation of such classes that 
have no guaranteed lower bound of data rate, but instead, 
have the service implemented by the utilization of the 
instantaneously available bandwidth (Best Effort). The 
invention is based utilizing in the scheduler control, not only 
the information indicating the class of service, but also the 
information indicating the subgroup inside a class of service 
(e.g., drop precedence). The information indicating the 
subgroup is conventionally utilized only for congestion 
control purposes. 
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Method and apparatus for scheduling available link bandwidth between packet- 
switched data flows 

The invention relates to a method according to claim 1 for scheduling available link 
bandwidth between packet-switched data flows. 

Hie invention also relates to an apparatus according to claim 5 for scheduling avail- 
able link bandwidth between packet-switched data flows. 

In the following text elaborating both the prior art and the present invention, the 
following abbreviations will be used: 

BE Class of service for applications that are allowed to utilize the instanta- 
neously available bandwidth of a network but are not guaranteed to a 
minimum data rate nor an upper bound for packet transfer delay and delay 
jitter (Best Effort), 

CoS Class of Service, 

DSCP Packet header information indicating class of service of a packet (Differen- 
tiated Services Code Point), 
FIFO First In First Out queuing discipline, 

G+E Class of service for applications that are allowed to utilize the instantane- 
ously available bandwidth of a network and are guaranteed to a minimum 
data rate but not to an upper bound for packet transfer delay and delay jitter 
(Guaranteed Rate and Best Effort), 

QoS Quality of Service, 

RT Class of service for applications for which the packet transfer delay and 

delay jitter is minimized and for which the minimum data rate is guaranteed 
but the applications cannot utilize the instantaneously available bandwidth 
of a network (Real Time), 
SFQ Start-time Fair Queuing, a weighted queuing discipline [1], 
wfq Weighted queuing discipline, the abbreviation being used as a generalized 
concept (weighted fair queuing), 
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WFQ Weighted Fair Queuing, a specific weighted queuing discipline [1 J, 
WRED Weighted congestion avoidance algorithm [3, 4] (Weighted Random Early 
Detection). 

5 In packet-switching networks, it is often advantageous to classify the data packets to 
be transferred into different classes of service (CoS) on one hand by the needs of 
various applications using the data network services and, on the other hand, by the 
QoS level agreements of a telecom service provider with his customers. In conjunc- 
tion with a conventional telephone connection, for instance, it is essential that the 
10 bandwidth required by the application is available for predetermined time with a 
sufficiently low data transfer delay and delay jitter. In a telephone application, the 
user gains nothing from the possibility of having access to a temporarily higher link 
bandwidth in a low-load situation of the network. In contrast, during the download of 
a www page, for instance, it would be extremely advantageous to have access to the 
15 rail temporarily available bandwidth of the network. 

Next, a situation examined in which the telecom service provider offers the following 
classes of service: 
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- RT (Real Time): service class for applications that are guaranteed to a minimum 
data rate and the packet transfer delay and delay jitter are minimized without any 
attempt to increase the instantaneous data rate offered to the given application 
even when the traffic load imposed on the communications network would 
momentarily happen to be at low level. 



- G + E (Guaranteed Rate and Best Effort): service class for applications guaran- 
teed to a given minimum data rate and additionally offering all the instanta- 
neously available bandwidth of the data transmission system to the use of the 
application. However, no commitments to guaranteed upper bounds of the packet 

30 transfer delay and delay jitter are given. 

- BE (Best Effort): service class for applications allocated to utilize the instanta- 



neously available bandwidth of a network without any guaranteed minimum data 
transfer rate. Neither are commitments made as to any upper bounds given for the 
packet transfer delay and delay jitter. 

FIG. 1 shows a conventional arrangement for scheduling the bandwidth of a common 
data transmission link between data flows representing the above-listed classes of 
service. The function of the system shown in FIG. 1 is as follows: 

- The class of service assigned to a given package is identifiable by header 
information carried in the packet (e.g., DSCP, Differentiated Services 
Code Point [2]). 

- The received packets are scheduled to respective class-of-service specific 
FIFO queues (RT, G+E and BE queues). 

- Each one of the packets categorized in class of service G+E is further 
assigned to an internal subgroup of the CoS allowing at least a decision 
to be made whether the packet is belonging to the portion of traffic com- 
mitted to the guaranteed minimum data rate (hereinafter G portion) or to 
the portion of traffic exceeding the guaranteed minimum rate (hereinafter 
E portion). Assigning a packet to a given subgroup can be indicated, e.g., 
by the priority information (drop precedence) carried in the DSCP [2]. 
The subgroup information is utilized when a decision must be made at a 
case of queue congestion to which packets the actions of congestion 
control policy shall be applied. An example of this approach is the 
WRED (Weighted Random Early Detection) congestion control method 
[3,4]. 

- The link bandwidth is scheduled for data flows of the RT queue 1, the 
G+E queue 5 and the BE queue 6 using a weighted scheduling discipline 
(e.g., SFQ [1]) such that the weight (Wrj) of the RT queue 4 is selected 
so large in regard to the weights (Wou and W BE ) of the G+E and BE 
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queues that the traffic of the class RT under all conditions has access to 
the minimum bandwidth allocated thereto, while the weight of the G+E 
queue 5 is selected so large in regard to the weight of the BE 6 queue that 
the traffic of the class G+E under all conditions is granted to have access 
to the guaranteed minimum data rate. 

- Traffic of the class RT and the G portion of the class G+E are assumed to 
be bandwidth limited prior to the scheduler. 

FIG 2 shows another conventional arrangement for scheduling the bandwidth of a 
common link between data flows representing the above-listed classes of service. 
The function of the system shown in FIG 2 differs from the function of the system 
shown in FIG. 1 by having the link bandwidth scheduled for the RT queue 4 at a 
priority before the G+E 5 and BE queues 6. The use of a priority scheduling 
discipline for the RT queue 4 is possible inasmuch as the traffic of the RT queue 4 is 
assumed to be bandwidth limited prior to entering the input port of the scheduler. 

A problem in the scheduling arrangements shown in FIGS. 1 and 2 is that the traffic 
of class BE is contested in the scheduler by the traffic flow of the E portion in class 
G+E with weight whose value in regard to weight W BB of class BE is selected 
based on the guaranteed minimum data rate (guaranteed for the G portion) of class 
G+E. Resulting^ class BE has a poor capability of utilizing the instantaneously 
available bandwidth when also simultaneously the traffic flow of the E portion in 
class G+E attempts to utilize the same instantaneously free link bandwidth. This, 
however, is contrary to the basic idea of class BE traffic that guarantee no lower 
bound for the data transfer rate but instead provide the service user full access to the 
utilization of the instantaneously available bandwidth. 

The situation is elucidated by exemplary cases (a) and (b) illustrated in FIG. 3. In the 
diagram, exemplary case (a) corresponds to bandwidth sharing between traffic flows 
of different classes of service when traffic from each class of service is transmitted in 
maximum possible amount Herein, the ratio of bandwidth amount (Bo+e) used by 
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the traffic of class G+E to the bandwidth amount (B BE ) used by class BE is Wq+e 
/W be . Exemplary case (b) corresponds to bandwidth sharing between data flows of 
different classes of service when the bandwidth reservations for both the traffic of 
class RT and the G portion of class G+E are the same as in exemplary case (a) with 

s the exception that the fraction of bandwidth used by the traffic of class RT is less 
than the bandwidth reserved for the class, while the traffic of classes G+E and BE is 
transferred as much as possible. Also in this situation, the bandwidth utilization ratio 
is Bo+b/Bbe = Wg+b/Wbb. As is obvious from exemplary case (b), the fraction of 
bandwidth remaining unutilized by the traffic of class RT is given almost entirely to 

10 the E portion of class G+E. 

It must be noted that, since the scheduler is not allowed to change the forwarding 
order of packets in the traffic flow of class G+E, the G and E portions of class G+E 
cannot be separated into different queues that could be given mutually independent 
is scheduling weights. 

It is an object of the present invention to overcome the drawbacks of the above- 
described prior art and to provide an entirely novel type of method and apparatus for 
scheduling the instantaneously available bandwidth between different packet- 
20 switched data flows. More particularly, the invention relates to a method capable of 
implementing a scheduler such that the instantaneously available bandwidth is 
allocated in a desired ratio (e.g., 1 : 1) between the traffic flows of E portion in class 
G+E and the traffic flow of class BE. 

25 The goal of the invention is achieved by virtue of employing the subgroup informa- 
tion (e.g., drop precedence) in the operational control of a scheduler. In the prior art, 
subgroup information has only been used in a congestion control system (e.g., 
WRED). Yet, the scheduling method according to the invention does not exclude the 
use of subgroup (e.g., drop precedence) information in a congestion control system, 

30 tOO. 

More specifically, the method according to the invention is characterized by what is 
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stated in the characterizing part of claim 1. 

Furthermore, the apparatus according to the invention is characterized by what is 
stated in the characterizing pact of claim 5. 

5 

The invention offers a significant benefit over the prior art by way of allowing the 
scheduling engine to be implemented in such a fashion that the available residual 
bandwidth is allocated in a desired ratio (e.g., 1:1) between the data flow of the E 
portion in class G+E and the data flow of class BE. As a result, it becomes possible 

10 to provide a class of service (G+E) such that the instantaneously available bandwidth 
of a data transfer network can be utilized and, simultaneously, a guaranteed 
minimum data rate can be assured without compromising the quality of service in 
such classes (e.g., BE) that have no guaranteed lower bound of data transfer rate, but 
instead, have the service implemented by utilization of the instantaneously available 

15 bandwidth. 

In the following, the invention is described in more detail with reference to exempli- 
fying embodiments by making reference to the appended drawings in which 

20 FIG 1 shows a block diagram of a prior-art system for scheduling the bandwidth of a 
common data transmission link for data flows of the above-discussed classes of 
service (RT, G+E, BE); 

FIG 2 shows a block diagram of another prior-art system for scheduling the band- 
25 width of a common data transmission link for data flows of the above-discussed 
classes of service; and 

FIG 3 shows two exemplary cases (a) and (b) of the division of the instantaneously 
available bandwidth between the data flows of different classes of service. In exem- 
30 plary case (a), the maximum amount of traffic per each class of service is transferred. 
In exemplary case (b), the fractions of bandwidth reserved for the class RT and the G 
portion of class G+E, respectively, are the same as those in exemplary case (a) but 



the bandwidth used by the traffic of class RT is less than the upper bound of 
bandwidth reserved for the class, while the traffic of classes G+E and BE is 
transferred at maximum bandwidth; and 

FIG. 4 shows a block diagram of a system according to the invention for scheduling 
the bandwidth of a common data transfer link between data flows of the above- 
discussed classes of service. 

The theoretical basics of the method according to the invention are elucidated in the 
subsequent description. 

In a weight-based scheduling method, the packets received at the input port of the 
scheduler are marked with a forwarding order indicator (e.g., Start.tag in the SFQ 
method [1]) telling the instant at which the packet is scheduled forward. Hence, the 
first packet to be forwarded is the one having an order indicator with a value indi- 
cating the earliest instant of forwarding. The transmission order indication need not 
be synchronized with the real time, but rather, it is sufficient to have the forwarding 
indicators of the packets in a correct transmission order in regard to each other. 

In the generation of the forwarding order indicator for a packet received from a given 
class of service queue, the packet weight is assigned according to the respective class 
of service. If queue Jl has a higher weight than queue J2, the forwarding indicator 
sequence of successive packets of queue Jl in regard to the respective forwarding 
indicator sequence of queue J2 has such a character that queue Jl gains a larger 
fraction of scheduler output capacity. 

In a priority-based sequencing method, a priority value is assigned to each one of the 
packets received at the scheduler 1 input port. The packets' priority values determine 
which one of the packets is to be forwarded next 

In the method according to the invention, however, the priority value assigned to a 
packet or, respectively, the weight applied to the generation of the packet's forward- 
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ing order indicator is dependent, not only on the class of service of the packet 
(hereinafter designated by symbol q), but also on the subgroup information 
(hereinafter designated by symbol p, such priority information being, e.g., packet 
drop precedence [2]) of the packet in question and/or of packets preceding or 
following the packet in question in the same class of service, FIG. 4. Inasmuch as the 
packets are contained by class of service specific queues in front of the sequencer 1 , 
it is thus possible to know within the limits determined by the number of packets 
contained by the queue what kind of subgroup information is carried in the packets 
destined to enter the scheduler in the near future. 

hi the method according to the invention, an item or plural items of subgroup inform- 
ation may also determine whether the scheduling decisions concerning a given 
packet are made using a weight-based or a priority-based scheduling mechanism. 

In contrast, prior-art systems employ subgroup information (p) for congestion control 
operations but not for scheduling. 

Next, an embodiment of a scheduler according to the invention is described as to its 
scheduling function of the traffic flows of classes G+E and BE using an SFQ 
algorithm [1]. In the exemplary embodiment of the invention discussed herein, the 
packet-specific weight is selected on the basis of the subgroup whereto the packet 
under consideration belongs. The forwarding order indicators (So*(i) and S BE (/)) of 
packet i in class G+E and packet, in class BE. respectively, are computed as follows: 

So+eCO = max { v, Sou 0-1) + L(i-1) / W(q, p)}, (1) 
Sbe(/) = max { v, S BE (/-l) + L(f-1) / Wfo, p)), (2) 

wherein L(M), Ltf-l) are the packet sizes in bytes, for instance, variables p and q 
determine the value of weight W such that variable q is dependent on the class of 
service (G+E or BE) assigned to the packet (i otj) being examined and variable, is 
dependent on the subgroup assigned to the packet (« or J) being examined, and v is 
the forwarding order indicator (virtual time stamp) of the packet being transferred. 



The value of the forwarding order indicator is computed when the packet is received 
at the class of service -specific input port of the scheduler and will not be updated 
later even if the value of v should change. Of the inbound packets, the first to be 
forwarded is the one (i or j) having the lower value of forwanling indicator. 

In the exemplary case discussed herein is assumed that the subgroup-specific weights 
are selected as follows: 

if packet of class G+E belongs to G portion, W(q =*G+E\ p = 'G') = W G , 
if packet of class G+E belongs to E portion, W(q ='G+E\p = *E') = W E , 
packets of class BE have the identical weights irrespective of the subgroup 
information, that is, Wfo ='BE'; p: irrelevant) = W BB . 

Hereinafter, a simple test or simulation is sufficient to verify the following fact: if 
over a given period of time the system transfers an average amount W G of the bytes 
(or bits) of G portion packets, then also over the period of time the system transfers 
an average amount W B b of the bytes (or bits) of class BE packets and, respectively, if 
over a given period of time the system transfers an average amount W B of bytes (or 
bits) of E portion packets, then also an average amount W B b of the bytes (or bits) of 
class BE packets are transferred. To further simplify the situation, it is possible to 
assume all packets to have an equal size, whereby the above-stated is true, not only 
for the bytes of packets, but also for complete packets. 

By a suitable selection of values for weights W E and W BE , a scheduling machine may 
be implemented so that the available bandwidth is allocated in a desired ratio 
between the data flow of the E portion in class G+E and the data flow of class BE. 

An alternative embodiment of the above-described case can be realized by assigning 
weight W 0 an infinite value. In practice this means that packets of G portion are 
scheduled priority-based instead of using an SFQ discipline. Then, a packet received 
at the scheduler input port assigned for packets of subgroup G in class G+E queue is 
forwarded in prioritized fashion whatever the forwanling order indication of the 
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packet queuing at the input port serving the data How of class BE. This is feasible 
inasmuch as the traffic of the G portion in class G+E is assumed to be bandwidth 
limited. 

5 
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What is claimed is: 

1. A method for scheduling link bandwidth between different packet-switched data 
flows, the method comprising the steps of 

- digital data is transferred in the form of fixed- or variable-length packets, 

- the packets are marked with identifier information allowing the packets 
to be classified into at least two classes of service, 

- on the basis of the information indicating the class of service, each one of 
the inbound packets are individually classified into one of class of service 
-specific parallel FIFO queues (4, 5, 6), the number of the queues being 
one per each class of service, 

- at least one class of service is having its packets tagged with identifier 
information allowing the packets to be classified into at least two internal 
subgroups within said class of service, 

- packets of a given class of service form a data flow, wherein the forward- 
ing order of the packets is retained irrespective of the subgroup-defining 
identifier information carried in the packet, and 

- the available bandwidth of the outbound link or links of the system is 
scheduled (1) between said class-of-service specific FIFO queues using 
a weight-based scheduling discipline, a priority-based scheduling 
discipline or a combination thereof, 

characterized in that the packet-specific priority value in the priority-based schedul- 
ing discipline and/or the weight in the weight-based scheduling discipline is deter- 
mined by using the combined effect of variables q and />, whereby the value of 
variable q is dependent on the class of service (CoS) assigned to the data flow 
transferred by the packet in question and the value of variable p is dependent on the 
subgroup (e.g., drop precedence) whereto the packet in question belongs and/or on 
the classification into subgroups of the inbound packets of the same class of service 
that are received at the scheduler input port preceding or following the packet in 
question. 
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2. The method of claim 1. characterized in that the selection between the use of a 
weight-based or a priority-based scheduling discipline is made based on the subgroup 
whereto the packet in question belongs and/or how the inbound packets of the same 
class of service received at the scheduler input port preceding or following the packet 
in question are distributed between the subgroups. 

3. The method of claim 1, characterized in that said weight-based scheduling 
discipline is an SFQ (Start-time Fair Queuing [1 J) discipline. 

4. The method of claim 1, characterized in that said weight-based scheduling 
discipline is a WFQ (Weighted Fair Queuing [1]) discipline. 

5. An apparatus for scheduling link bandwidth between different packet-switched 
data flows, the apparatus comprising 

- means for receiving digital data in the form of fixed- or variable-length 
packets, 

- means for reading identifier information carried in said inbound packets 
and allowing said packets to be classified into at least two different 
classes of service, 

- means for individually classifying said inbound packets into at least two 
different classes of service, 

- a FIFO packet queue (4, 5, 6) for each one of said classes of service, 

- means for directing a given packet on the basis of its class of service - 
specific identifier information in the respective class of service -specific 
FIFO queue, 

- means for reading from a given packet its identifier information that 
allows said packet to be classified as to the internal subgroup of the class 
of service assigned to the packet, 

- a scheduler (1) for scheduling the available bandwidth of outbound 
link(s) of the system for ciass-of-service specific FIFO queues (4, 5, 6) 
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using a weight-based scheduling discipline, a priority-based scheduling 
discipline or a combination thereof, and 
- means for sending packets to an outbound link(s) in the forwarding order 
of the packets determined by said scheduler (1), 

characterized in that said apparatus includes means few on per packet basis deter- 
mination of priority value in the priority-based scheduling discipline and/or weight in 
the weight-based scheduling discipline using the combined effect of variables q and 
p, whereby the value of variable q is dependent on the class of service (CoS) 
assigned to the data flow transferred by the packet in question and the value of 
variable p is dependent on the subgroup (e.g., drop precedence) whereto the packet in 
question belongs and/or on the classification into subgroups of the inbound packets 
of the same class of service that are received at the scheduler input port preceding or 
following the packet in question. 

6. The apparatus of claim 5, characterized in that said apparatus includes means for 
decision-making between the use of a weight-based or a priority-based scheduling 
discipline is carried out based on the subgroup whereto the packet in question 
belongs and/or how the inbound packets of the same class of service received at the 
scheduler input port preceding or following the packet in question are distributed 
between the subgroups. 

7. Hie apparatus of claim 5, characterized in that said apparatus includes means for 
carrying out a weight-based scheduling discipline using an SFQ (Start-time Fair 
Queuing [1]) discipline. 

8. The apparatus of claim 5, characterized in that said apparatus includes means for 
carrying out a weight-based scheduling discipline using a WFQ (Weighted Fair 
Queuing [1]) discipline. 
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